A New Simultaneous Two-Levels Coclustering Algorithm for Behavioural Data-Mining

نویسندگان

  • Guénaël Cabanes
  • Younès Bennani
  • Dominique Fresneau
چکیده

Clustering is a very powerful tool for automatic detection of relevant sub-groups in unlabeled data sets. It can be sometime very interesting to be able to regroup and visualize the attributes used to describe the data, in addition to the clustering of these data. In this paper, we propose a coclustering algorithm based on the learning of a Self Organizing Map. The new algorithm will thus be able at the same time to map data and features in a low dimensional sub-space, allowing simple visualization, and to produce a clustering of both data and features. The resulting output is therefore very informative and easy to analyze.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cocluster analysis of thalamo-cortical fibre tracts extracted from diffusion tensor MRI

As the central relay station of the human brain, the thalamus modulates sensory signals to and from the cerebral cortex. The reciprocal connectivity between the cerebral cortex and the thalamus is believed to play an essential role in consciousness and various neurological disorders. Thus, in-vivo analysis of thalamo-cortical connectivity is important for our understanding of normal and patholo...

متن کامل

Calculation of One-dimensional Forward Modelling of Helicopter-borne Electromagnetic Data and a Sensitivity Matrix Using Fast Hankel Transforms

The helicopter-borne electromagnetic (HEM) frequency-domain exploration method is an airborne electromagnetic (AEM) technique that is widely used for vast and rough areas for resistivity imaging. The vast amount of digitized data flowing from the HEM method requires an efficient and accurate inversion algorithm. Generally, the inverse modelling of HEM data in the first step requires a precise a...

متن کامل

A New Algorithm for High Average-utility Itemset Mining

High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...

متن کامل

Symbolic Representation of Time Series: A Hierarchical Coclustering Formalization

The choice of an appropriate representation remains crucial for mining time series, particularly to reach a good trade-o between the dimensionality reduction and the stored information. Symbolic representations constitute a simple way of reducing the dimensionality by turning time series into sequences of symbols. SAXO is a data-driven symbolic representation of time series which encodes typica...

متن کامل

New Approaches to Analyze Gasoline Rationing

In this paper, the relation among factors in the road transportation sector from March, 2005 to March, 2011 is analyzed. Most of the previous studies have economical point of view on gasoline consumption. Here, a new approach is proposed in which different data mining techniques are used to extract meaningful relations between the aforementioned factors. The main and dependent factor is gasolin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011